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Abstract 

Background: In the anaerobic pathway of cobalamin (vitannin B12) synthesis, the CbiT enzynne plays two roles, as a 
cobalt-precorrin-7 C15-methyltransferase and a C12-decarboxylase, to produce the internnediate, cobalt-precorrin 8. 

Results: The primary structure of the hypothetical protein IVIJOSQI, from Methonocoldococcus jonnoschii suggested 
that 1\/1J0391 is a putative CbiT. Here, we report the crystal structure of IVIJOSQI, solved by the MAD procedure and 
refined to final R-factor and R-free values of 19.8 & 27.3%, respectively, at 2.3 A resolution. The asymmetric unit 
contains two NCS molecules, and the intact tetramer generated by crystallographic symmetry may be functionally 
important. The overall tertiary structure and the tetrameric arrangements are highly homologous to those found in 
l\/lT0146/CbiT from Methanobacterium thermoautotrophicum. 

Conclusions: The conservation of functional residues in the binding site for the co-factor, AdoMet, and in the 
putative precorrin-7 binding pocket suggested that 1\/1J0391 may also possess CbiT activity. The putative function of 
1\/1J0391 is discussed, based on structural homology. 

Keywords: Cobalamin biosynthesis. Vitamin B^, Precorrin 7 C15-methyltransferase, Decarboxylase, CbiT, Crystal 
structure, AdoHcy cofactor, 1\/1J0391, Methanocaldococcus jannaschii 



Background 

The de novo synthesis of vitamin B12 (cobalamin) occurs 
via a complex biochemical pathway involving more than 
30 enzyme-mediated steps [1,2]. Therefore, many organ- 
isms do not use cobalamin as a coenzyme, and neither 
synthesize nor require the vitamin in their metabolism. 
Interestingly, only certain eubacteria and archaea 
synthesize cobalamin, and the pathway apparently never 
made the transition to eukaryotes [3]. Although they are 
genetically distinct, two related pathways are involved in 
cobalamin biosynthesis in prokaryotes: the aerobic and 
anaerobic pathways [4]. The aerobic pathway requires 
molecular oxygen and is characterized by the compara- 
tively late chelation of cobalt into the corrin ring [2,5-8]. 
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In contrast, the anaerobic pathway does not require mo- 
lecular oxygen, and the cobalt is inserted at an early 
synthetic stage [4,9,10]. For example, the pathway in 
Pseudomonas denitrificans, an aerobe, incorporates mo- 
lecular oxygen into the precorrin-3 macrocycle as a pre- 
requisite to ring contraction, and hence is classified as an 
aerobic pathway. The pathway in Salmonella typhimurium 
synthesizes cobalamin only in the absence of molecular 
oxygen. This pathway utilizes cobalt ion chelated into 
precorrin-2 (or Factor II) to set the stage for ring contrac- 
tion, and hence is classified as anaerobic. The biosynthesis 
of precorrin-2 is common to both the aerobic and ana- 
erobic pathways. At this stage, the two pathways divide, 
rejoining only after the synthesis of Cob(II)yrinate a,c 
diamide (Figure 1) [1]. 

The intermediate steps of the aerobic biosynthetic 
pathway, from uroporphyrinogen III (urogen III) to 
adenosylcobalamin, have been elucidated by the over- 
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Figure 1 The aerobic and anaerobic pathiways of cobalamin synthiesis. CobL in the aerobic pathway is colored red. CbiE and CbiT, which are 
equivalent to CobL, are colored blue and red with a boxed label, respectively, in the anaerobic pathway. The chemical structures of cobalt- 
precorrin-6B, 7 and 8, which are the substrates and products of CbiE and CbiT, are shown at the bottom of the figure. 



Co-precorrin 8 



expression of the cob genes from P, denitrificans (for 
recent reviews, see [2,11]), and the related assignments 
of the functions of the encoded enzymes, based on their 
catalytic activities. The detailed intermediate mecha- 
nisms of the anaerobic pathway for cobalamin biosyn- 
thesis are progressively emerging, through combinations 
of molecular genetic, biochemical and three-dimensional 
structural studies. The cbi genes involved in the biosyn- 
thesis of cobalamin by the anaerobic pathway have been 
over-expressed from not only S, typhimurium but also 
Bacillus megaterium [12], Propionibacterium freuden- 
reichi (P. shermanii) [13] and Methanocaldococcus 
jannaschii [14]. 

The functions of CbiA, -B, -C, -D, -E, -F, -G, -H, -J, -K 
(X), -L, -P, and -T in the anaerobic pathway were 
assigned, mainly based on the sequence similarity to 
their counterparts in the corresponding, well charac- 
terized aerobic pathway of P, denitrificans (Figure 1). 
Experimental evidence for the functions of CbiB, -H, -K, 
and -L, and for those of CbiA, -D, -F, -G, and -T, 
among the 13 catalytic Cbi proteins, was also recently 
reported [15-17]. However, only the crystal structures of 
CbiC [18], CbiF [19], CbiK [20], CbiL [21] and CbiT [22] 
have been solved. 

In the aerobic pathway, the enzyme CobL catalyzes the 
methylations at C-5 and C-15, and the decarboxylation of 
the C-12 acetate side chain of precorrin-6B. In contrast, in 
the anaerobic system, the functions of CobL are shared by 
two separate enzymes, CbiE and CbiT. Based on the 



sequence similarity, it has long been assumed that CbiE 
and CbiT are the methyltransferase and the decarboxylase, 
respectively: CbiE catalyzes the addition of the two methyl 
groups to cobalt-precorrin-6B, and CbiT then decarboxyl- 
ates the C-12 acetate side chain to afford cobalt-precorrin 
8. However, recent studies based on structural [22] and 
biochemical analyses revealed that CbiT possesses both 
the decarboxylase and methyltransferase activities; i.e., 
CbiT alone can catalyze both the methylation at C-15 and 
the decarboxylation of the C-12 acetate side chain, even in 
the absence of C-1 methylation [16]. 

The sequence analysis of the hypothetical protein 
MJ0391, from M, jannaschii (hereafter, referred to as 
MJ0391), suggested that it is closely related to MT0146/ 
CbiT, with 33% sequence identity. Our crystal structure 
analysis of MJ0391, determined by the multiple anomal- 
ous dispersion (MAD) method, suggested that MJ0391 
may possess the putative dual functional activities of 
decarboxylation and methyltransfer in the cobalamin bio- 
synthesis pathway. 

Results and discussion 

Overall structure of MJ0391 

The overall tertiary structure of the MJ0391 protein pos- 
sesses the Rossmann-like "AdoMet-dependent methyl- 
transferase" fold, comprising seven |3-strands (Figure 2a). 
A long flexible stretch with a non-specific secondary 
structure conformation is observed at the N-terminal 
region, Met4 - Thrl9. The methyltransferase fold is very 
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Figure 2 The structure of IVIJ0391. (a) Ribbon diagram (stereo view) of tine tertiary structure of MJ0391 (colored in a rainbow ramp from blue 
at the N-terminus to red at the C-terminus). (b) The ribbon diagram of the MJ0391 dimer in the asymmetric unit (shown in an arbitrary 
orientation). Subunit A is colored in a rainbow ramp as in (a), and subunit B is colored green. In the dimer, (36 of subunit A and (36' of subunit B 
form an intermolecular antiparallel (3-sheet. The MES molecule is shown as a stick model. All figures were produced with the PyMOL program 
(Schrodinger, LLC), unless otherwise mentioned, (c) The quaternary structure of MJ0391. Stereo view of the intact tetramer arrangement of 
MJ0391, produced by the crystallographic 2-fold axis. The tetramer is shown in an arbitrary orientation. Subunits A-D are colored red, green, 
orange and blue, respectively. A few structural elements at the tetramer interface are labeled, for clarity. 



similar to a canonical Rossmann fold, but it includes an 
antiparallel strand ((38) between the parallel strands, |35 
and |36, at the C-terminal region. Furthermore, this fold 
is modified by the extension of the |36 strand, which 
forms a long irregular |3 hairpin with the |37 and |38 
strands (Figure 2a). The canonical class I Rossmann-like 
methyltransferase fold consists of a central twisted seven- 
stranded p-sheet (|33-|32-|31-|34-|35-|38-|36), flanked by two 
bundles of helices on both sides. All of the strands are 
parallel except for the p8 strand, which is antiparallel to 
the other strands. Since the structure of MJ0391 is quite 
similar to that of MT0146/CbiT [22], the same nomencla- 
ture has been used for its description. 



Oligomeric structure 

The asymmetric unit contains two MJ0391 molecules 
(subunits A and B). The two subunits are stabilized by 
the formation of intermolecular antiparallel |3-sheets 
between |36 and |36' (Figure 2b). The crystal packing 
analysis revealed that the MJ0391 protein forms an 
intact tetramer (dimer of dimers) through the crystallo- 
graphic 2-fold axis (Figure 2c). Each subunit of the 
tetramer contributes approximately 27 intermolecular 
electrostatic interactions to facilitate tetramerization 
(Table 1). In addition to these direct intermolecular in- 
teractions, several indirect hydrogen bond interactions 
are also formed through buried water molecules at the 
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Table 1 Potential electrostatic interactions between 

subunit A and other subunits of MJ0391 

Subunit A Interacting subunits Distance 

(A) 

Chain aa Atoms Chain aa Atoms 

A Glu21 OEl B LysSO NZ 3.43 

A Glu22 OEl B Lys30 NZ 2.81 

A Lys30 NZ B Glu22 OEl 2.78 

A Leul34 N C Hisl68 0 2.87 

A Glul35 OEl C Hisl68 N 2.85 

A Glul35 0E2 C Hisl68 NDl 2.64 

A Glul35 0E2 D Asnl42 ND2 3.05 

A Asnl42 ND2 D Glul35 0E2 2.96 

A Glul45 OEl B Lysl63 NZ 3.46 

A Glul45 OEl B Hisl68 NE2 2.9 

A Glul45 0E2 B Lysl63 NZ 3.38 

A Glul45 0E2 B Hisl68 NE2 2.96 

A Alal53 0 B Alal61 N 2.86 

A Asnl55 N B Serl59 0 2.64 

A Asm 55 0E2 B Serl59 N 3.03 

A Asm 55 0E2 B Serl59 0 3.35 

A Phel57 N B Phel57 0 2.89 

A Phel57 N B Phel57 N 2.84 

A Serl59 N B Asnl55 0 2.97 

A Sen 59 0E2 B Asnl55 N 2.82 

A Serl59 0E2 B Asnl55 0 3.39 

A Alal61 N B Alal53 0 3.03 

A Hisl68 N C Glul35 OEl 2.85 

A Hisl68 N C Glul35 0E2 3.5 

A Hisl68 NE2 B Glul45 OEl 3.18 

A Hisl68 0 C Leul34 N 2.87 

A Alal72 0 C Alal72 0 2.74 



tetramer interface. The long, irregular p-hairpin, which 
is formed between the |36 and p7/p8 strands, plays a 
critical role in the tetramer formation (hereafter, referred 
to as the tetramerization hairpin (TH) region). The 
tetramerization occurs due to the contributions of the 
TH regions of each subunit, A-D. The TH region of sub- 
unit A forms intermolecular interactions with subunits 
B and C (Figure 2c). The p6 strand of subunit A forms 
an antiparallel sheet with p6' of subunit B (Figure 2b), 
while the region connecting the p7 and p8 of subunit A 
strands forms intermolecular interactions with the corre- 
sponding region in subunit C (Figure 2c). 

An intermolecular twisted p-sheet is formed between 
p6 and p6' of subunits A and B, respectively. The resi- 
dues Alal53, Asnl55, Phel57, Serl59, and Alal61 
within p6 of subunit A contribute intermolecular main- 
chain hydrogen bond interactions with the residues 



Alal61, Serl59, Phel57, Asnl55, and Alal53 of subunit 
B, respectively (Table 1). Moreover, the residues His 168 
and Alal72 of subunit A, which are located in p7, form 
main-chain hydrogen bonds with Leul34 (a6) and 
Alal72 (p7), respectively, of subunit C. 

The al helix also contacts the corresponding helix of 
its dimer counterpart. The subunit A residues Glu21 
and Glu22 form potential salt links with LysSO of sub- 
unit B. An additional salt link is also observed between 
LysSO of subunit A and Glu22 of subunit B (Table 1). 
The a6 helix of subunit A potentially interacts with 
subunits B, C and D (Figure 2c). In subunit A, GlulSS 
electrostatically interacts with Hisl68 and Asnl42 of 
subunits C and D, respectively, while Asnl42 of subunit 
A forms a salt-link with GlulSS of subunit D. Moreover, 
the Glul45 residue of subunit A also electrostatically 
interacts with Lysl6S and His 168 of subunit B. Besides 
these electrostatic interactions, about 17 residues of sub- 
unit A contribute substantial hydrophobic interactions 
toward tetramer formation (Table 2). The homologous 
protein MT0146/CbiT forms a tetramer in the crystal 
structure and in solution [22]. The buried surface area 
per monomer within the intact tetramer is about 1,907 
A^ (-21% buried surface area), as calculated by the 
program SURFACE in the CCP4 suite. Since the overall 
tetramer arrangement in the MJ0S91 structure is very 
similar to that found in MT0146/CbiT, we speculate that 
in solution, MJ0S91 may also form a tetramer that is 
functionally important for its biological activity. 

Comparison with homologous structures 

A DALI [2S] search revealed that although MJ0S91 
shares less sequence similarity with MT0146/CbiT from 
Methanobacterium thermoautotrophicum (SS% identity; 
Z-score: 27.S), the precorrin-6Y C5, CIS - methyl- 
transferase from Geobacter metallireducens GS-15 
(QS9YF0; S2% identity; Z-score: 26.0), and the putative 
precorrin-6Y C5, C15 - methyltransferase from Coryne- 
bacterium diphtheriae (Q6NHA4; 17% identity; Z-score: 
17.9) (Figure Sa), the overall tertiary structures and the 
quaternary structure arrangements are essentially 
conserved between them. The superimposition of the 
MJ0S91 structure onto the structures of MT0146/CbiT 
(PDB ID: ILSI), QS9YF0 (PDB ID: SE05; 211-405aa), 
and Q6NHA4 (PDB ID: SHM2; 2S6-410aa) yielded r.m. 
s.d. values of 1.6, 1.7, and 1.9 A, respectively, for all C" 
atoms (Figure Sb). Recently, the crystal structure of the 
C-terminal region of CobL (CobL^) from Rhodobacter 
capsulatus was reported [24]. The enzyme CobL, which 
uses precorrin-6B as a substrate, promotes the decarb- 
oxylation of the acetic acid side chain at CI 2, and the 
methylation at the C5 & CIS meso positions, to produce 
precorrin-8X. The superimposition of MJ0S91 onto the 
CobL^ structure (PDB ID: SNJR) yielded Z-score and r. 
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Table 2 Hydrophobic interactions between subunit A and 
other subunits of MJ0391 



Subunit A Interacting subunits 
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m.s.d values of 25.6 and 1.8 A, respectively, for 173 C" 
atoms (Figure 3b). 

As expected, the DALI search with MJ0391 also iden- 
tified proteins belonging to the methyltransferase family. 
For example, MJ0391 is structurally homologous to 
m^A58 tRNA methyltransferase, TrmI (PDB ID: 1054, 
Z-score: 20.6, 2.9 A r.m.s.d. for 174 C" atoms, 12% 
identity), m^U1939 23S rRNA methyltransferase, RlmD 
(PDB ID: 2BH2, Z-score: 20.0, 2.6 A r.m.s.d. for 169 C" 
atoms, 16% identity), catechol O- methyltransferase, 
COMT (PDB ID: 1JR4, Z-score: 17.8, 2.3 A r.m.s.d. for 
161 C" atoms, 17% identity), and rRNA dimethylade- 
nosine transferase, Diml (PDB ID: 3FYC, Z-score: 18.7, 
2.4 A r.m.s.d. for 153 C" atoms, 30% identity). 

Comparison with MT0146/CbiT 

Although the overall tertiary structures of the MJ0391 
and MT0146/CbiT proteins are essentially similar. 



striking differences exist in the TH region, an important 
element for forming the tetrameric arrangements 
(Figure 4a). The dihedral angles at the equivalent posi- 
tions of Lysl63 (c|), v(/: -68, 140°) and Alal66 ((|), v(/: -69, 
81°) in MJ0391 and MT0146/CbiT, respectively, trigger 
large deviations in their respective TH elements. More- 
over, the p6 & p7 strands in MJ0391 are connected by a 
type II p-turn. The type II conformation is produced by 
incorporating Prol65 at the i+1 position. The p-turn is 
stabilized by an intra-hydrogen bond between the 
carbonyl-group of He 164 (i position) and the amino- 
group of Glyl67 (i+3 position). In contrast, in MT0146/ 
CbiT, p6 and p7 are connected by a relatively long loop, 
comprising six amino acids. The abovementioned struc- 
tural deviations cause the maximum r.m.s.d value of 6.2 
A between the C" position of Prol65 in MJ0391 and that 
of Asp 168 in MT0146. The a6 helix also participates in 
tetrameric interactions (Figure 2c); however, less struc- 
tural deviation is observed between the corresponding 
regions of MJ0391 and MT0146/CbiT. In addition, 
deviations were observed in the non-specific secondary 
structure regions, such as the N-terminal region and 
the loops connecting the secondary structural elements 
(Figure 4a). 

A comparison of the apo-form MJ0391 to the 
MT0146/CbiT complexed with AdoHcy (SAH molecule, 
S-Adenosyl-L-Homocysteine; PDB ID: 1L3I) [22] re- 
vealed that the co-factor binding site regions are similar 
in both structures (Figure 4b). Hence, we predict that 
the MJ0391 protein may also recognize the cofactor 
AdoMet in a similar manner as in MT0146/CbiT. The 
shallow binding pocket is produced by the head regions 
of pi, p2 and p4, which line the middle of the pocket. 
The pocket is surrounded by the a2 and a3 helices on 
one side, and the a4 and a5 helices on the other side 
(Figure 4b). The residues within the loops connecting p4 
and a5, p2 and a3, p3 and a4, pi and a2 and in the 
N-terminal flexible region may potentially form inter- 
actions with the AdoHcy molecule. The AdoHcy inter- 
acting residues are nearly conserved between these 
proteins, except for the residues Arg91, Tyr65, LysllO 
and Asnlll, which are replaced by Asp90, Arg63, 
Glylll and Glull2, respectively, in MT0146/CbiT. 
These loops are quite flexible, as observed in Figure 4a, 
and thus we speculate that they may undergo conform- 
ational changes upon binding to the AdoHcy/ AdoMet 
molecule. 

Near the AdoMet binding site, a deep groove is 
formed by segments from the Rossmann-like methyl- 
transferase core and the TH region (of the same subunit 
and of the adjacent subunit) (Figure 2c). Keller et al 
[22] suggested that this deep groove is a putative binding 
pocket for the precorrin molecule. Surface potential ana- 
lyses revealed that the MJ0391 protein is highly basic 
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Figure 3 Comparison with CbiT hiomologous proteins, (a) Sequence alignment of MJ0391 from Methanocaldococcus jannaschii, MT0146/CbiT 
from Methanobacterium thermoautotrophicum, precorrin-6Y C5, CI 5 - methyltransferase from Geobacter metallireducens GS-15 (Q39YF0), and the 
putative precorrin-6Y C5, CI 5 - metliyltransferase from Corynebocterium diphtherioe (Q6NHA4). Tine secondary structural features from MJ0391 are 
indicated above the alignments. The colors reflect the similarity (red boxes and white characters for conserved residues; red characters for similarity in 
a group; blue frames for similarity across groups). The sequence was aligned and rendered by Clustal W [25] and ESPript [26], respectively, 
(b) Superposition of MJ0391 (yellow) on the precorrin-6Y C5, CI 5 - methyltransferase structures from G. metallireducens (PDB ID: 3E05, blue) and CobL^ 
from R. capsulatus (PDB ID: 3NJR, cyan). 



(theoretical pi: 9.01) (Figure 4c), whereas its homologous 
protein, MT0146/CbiT, is highly acidic (theoretical pi: 
4.88). Although the electrostatic charge distributions are 
quite different between these two proteins, the charge 
distributions within the putative precorrin binding 
pocket are highly conserved between them (Figure 4c). 
The residues Prol7, Arg24, Lysl62, Lysl73 and Asnl74 
are located near the binding pocket in MJ0391. The 
equivalent corresponding residues in MT0146/CbiT are 
Prol4, Arg22, Argl65, Argl76 and Asnl77, respectively 
(Figure 3). Recent studies of S. typhimurium CbiT re- 
vealed that CbiT alone can catalyze both the methylation 
at CIS and decarboxylation of the CI 2 acetate side 
chain, even in the absence of CI methylation [16]. Since 



MJ0391 is structurally highly homologous to the known 
CbiT proteins, and the functional residues are conserved 
between them, we speculate that MJ0391 may also 
possess the dual roles of methylation and decarboxy- 
lation to form the intermediate, cobalt-precorrin 8, in 
the vitamin B12 biosynthesis pathway. However, further 
biochemical and crystallographic studies are required to 
understand the enzymatic mechanism of MJ0391 in 
cobalamin synthesis. 

Conclusions 

The structure of the hypothetical protein MJ0391, from 
M, jannaschiU was determined by the MAD procedure, 
which revealed the presence of a Rossmann-like "AdoMet- 
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Figure 4 Comparison of MJ0391 with IVlT0146/CbiT. (a) Stereo view of the superposition of MJ0391 (liglit green) on MT0146/CbiT (red), for all 
C°-atoms corresponding to one subunit of the MT0146/CbiT tetramer (PDB ID: 1L3I). (b) The AdoHcy binding site region (stereo view). The 
interacting residues and the AdoHcy molecule (shown as 'SAH' in the figure) of the MT0146/CbiT complex are colored grey & cyan, respectively. 
The equivalent predicted interacting residues in MJ0391 are yellow. The Tyr65 residue is modeled as Ala, since the electron density corresponding 
to its side chain was not visible, (c) Electrostatic surface potential of the MJ0391 tetramer. The surfaces are colored blue and red for positive and 
negative electrostatic surface potentials, denoting less than -15 and greater than +15 K^, respectively, where is the Boltzmann constant and T 
is the temperature. The yellow and green arrows indicate the putative precorrin and AdoMet binding pockets, respectively. The surface potential 
was calculated with the software APBS [27] incorporated in the PyMOL program (Schrodinger, LLC). 



dependent methyltransferase" fold. The tertiary and 
quaternary structures of MJ0391 are essentially similar 
to those of the CbiT or CobL*^ proteins from M. ther- 
moautotrophicum, G, metallireducens, and R, capsula- 
tus. Moreover, the functional residues in the putative 
precorrin binding pocket are highly conserved. Hence, 
based on structural comparison studies, we speculate 
that the MJ0391 protein may possess the putative dual 



C15-methyltransferase and C12-decarboxylase activities 
in the cobalamin biosynthesis pathway. 

Methods 

Protein expression and purification 

The gene encoding the MJ0391 protein (gi:15668567) 
was amplified via PGR using Methanocaldococcus 
jannaschii DSM 2661 genomic DNA, and was cloned 
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Table 3 Summary of data collection, MAD structure 
solution, and refinement statistics 



Se-Met Se-Met Se-Met 

peak inflection remote 



Data collection 






vvaveiengtn (Aj 


n n"7on 1 

u.y/oy 1 


n n~7mo n c\c\C\C\C\ 

u.y/yzo u.yuuuu 


Space group 


D/l 0 0 




Unit cell (Aj 


u no AA ^ 01 r\ A 

a = D = 93.44, C = 0 1 .04 




Resolution (A) ^ 


bU.UU-z.z U.zo-z.zUj 




Completeness (%) 


yo.D (/o./j 


no ^ (QC\ c\ on q /Tiq /1^ 

yo.D (oy.Dj yy.j (yj.4j 


Redundancy 


1 2.2 (8.3) 


1 Z.o (y.4j 1 j.U (y.Zj 


Average 1 / o(l) 


27.4 (1 .9) 


30.3 (z.zj 33./ (z.3j 


Rmerge ( /o) 


y. 1 i4z.Uj 


/.y (43.3; /.D i4D.yj 


MAD solution 






Resolution cutoff (A) 


Z.J 




No. Se-sites found 


6 (of 8) 




iviean ruivi 


U.DZ 




Overall Z-score 


24.3 




Refinement statistics 






No. molecules in a.u. 


2 molecules 




ncbOIULIOll MllllL \r\) 


ZU— Z.J 




R-factor/Rfree 


19.8/27.3 




Mean B factor (A^) 


49.0 




No. protein residues 


360 




No. MES molecules 


1 




No. SO4 molecules 


2 




No. water molecules 


108 




r.m.s. deviations 






Bond lengths (A) 


0.013 




Bond angles 0 


1.54 




^Numbers in parentheses are values in the highest resolution shell. 

^Rmerge = ^ I'obs - <l>| / 1 <l> summed over all observations and reflections. 

3Rcryst = I||Fobs|-|Fcalc||/I|Fobs|. 

"^Rfree was Calculated with 5% of data omitted from refinement. 


into the pET-21a expression vector (Merck Novagen, 



Darmstadt, Germany). The expression vector was intro- 
duced into the E. coli BL21-CodonPlus (DE3)-RIL-X strain 
(Stratagene, La Jolla, California, USA). The recombinant 
strain was cultured in 3 L minimal medium, containing 25 
(ig/mL selenomethionine, 30 (ig/mL chloramphenicol and 
50 (ig/mL ampicillin. The harvested cells (10.7 g) were 
lysed by sonication on ice, in 20 mL of 20 mM Tris-HCl 
buffer, pH 8.0, containing 500 mM NaCl, 5 mM 
p-mercaptoethanol and 1 mM phenylmethylsulfonyl fluor- 
ide. The cell lysate was heat-treated at 363 K for 11.5 min 
and centrifuged at 14,000 g for 30 min at 277 K. The 
supernatant was desalted by fractionation on a HiPrep 26/ 
10 column (GE Healthcare Biosciences) with 20 mM 
Tris-HCl buffer, pH 8.0. The fractions of interest were 
then applied to a Super Q Toyopearl 650M column 



(Tosoh, Tokyo) equilibrated with 20 mM Tris-HCl buffer, 
pH 8.0, and eluted with a linear gradient of NaCl. The tar- 
get fractions, which eluted in 0.1 M NaCl, were applied to 
a Resource Q column (GE Healthcare Biosciences) equili- 
brated with 20 mM Tris-HCl (pH 8.0). The flow-through 
fraction was collected and applied to a Hydroxyapatite 
CHT-I column (Bio-Rad Laboratories), equilibrated with 
10 mM potassium phosphate buffer (pH 7.0), and eluted 
with a linear gradient of potassium phosphate. The target 
sample, which eluted in 207.2 mM potassium phos- 
phate, was collected and applied to a HiLoad 16/60 
Superdex 200pg column (GE Healthcare Biosciences), 
equilibrated with 20 mM Tris-HCl buffer, pH 8.0, 
containing 200 mM NaCl. The protein sample was ana- 
lyzed by SDS-PAGE and was confirmed by N-terminal 
amino-acid sequencing. After concentration to 27.9 mg/ 
mL by ultrafiltration, the protein yield was 11.7 mg from 
10.7 g of cells. 

Crystallization and data collection 

Crystallization screening was performed by the sitting- 
drop vapor-diffusion method at 293 K. Each drop 
consisted of 1.0 (iL of the Se-Met protein (28 mg/mL) 
solution and 1.0 (iL of the reservoir solution, and was 
equilibrated against 100 (iL of reservoir solution. In the 
preliminary screening, small crystals were observed in 
the crystallization conditions of 40%(v/v) MPD, 0.1 M 
Na HEPES, pH 7.0. After optimization, crystals suitable 
for X-ray data collection appeared in 50%(v/v) PEG -400, 
0.1 M MES buffer, pH 6.0, within 2 months. The crystals 
were flash-cooled in a nitrogen-gas stream at 100 K, 
using 25%(v/v) glycerol as a cryoprotectant. 

Anomalous diffraction data were collected at the 
SPring-8 (Harima, Japan) BL26B1 beamline, at three 
wavelengths, 0.97891 (peak), 0.97928 (edge) and 0.90000 
A (remote), for a Se-MAD experiment. The data sets 
were collected at 100 K, using a Rigaku Jupiter 210 CCD 
detector. The crystal-to-detector distance was set to 200 
mm. The diffraction data were integrated, reduced, and 
scaled using the HKL2000 software suite [28]. The crys- 
tallographic data and refinement statistics are shown in 
Table 3. The asymmetric unit consists of two NCS 
molecules, with a molecular mass of 21 kDa and 183 
amino acid residues per monomer, and with a solvent 
content of about 42%. 

Structure determination and refinement 

The structure determination was performed by the 
multiple anomalous dispersion (MAD) method, using 
SOLVE/RESOLVE [29]. The MAD procedure yielded six 
distinct peaks, corresponding to six of the eight sele- 
nium atoms in the asymmetric unit, with an overall 
experimental mean figure of merit of 0.62 for all data up 
to 2.5 A resolution. The initial experimental phases were 
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further improved by solvent flattening with 2-fold non- 
crystallographic symmetry (NCS) averaging, using the 
program RESOLVE [30], which increased the figure of 
merit to 071. The modified experimental map obtained 
by RESOLVE was readily interpretable, and 42% of the 
protein chains were traced automatically. Moreover, 
several rounds of manual fitting and re-fitting were 
performed using the program COOT [31]. Initially, the 
model was refined with CNS [32], using a maximum 
likelihood target that included amplitude and phase 
probability distributions with NCS restraints. In the later 
stage of the refinement cycles, the refinement was 
performed using REFMAC5 [33], incorporated in the 
CCP4 package [34]. The final refined model of the dimer 
in the asymmetric unit contains 360 residues, one MES 
molecule, two S04^' molecules, and 108 water molecules, 
with a final i^^ork of 19.8% and an Rf,ee of 27.3% at 2.3 A 
resolution. The first three N-terminal residues of the two 
chains are missing in the final structure. The side chain 
atoms of Mse4, Argl2, Argl3, Tyr65, Leu66 and LysllO 
in chain A, and Mse4, Argl2, Argl3, Tyr65 and LysllO in 
chain B are also missing in the final structure. The stereo- 
chemistry of the complex structure is good, as assessed 
with MOLPROBITY [35,36]. The Ramachandran plot 
analysis of this structure revealed that 99.4% of all residues 
were in the allowed region. The outlier residues are 
Thrl09 and Asnl74 in chain A. The electron densities 
were clear within these disallowed regions. The refine- 
ment statistics are summarized in Table 3. 

PDB Reference 

The structural coordinates of MJ0391 have been depos- 
ited in the RCSB (PDB code: 2YXD). 
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NCS: Non-crystallography symmetry; MAD: Multiple anomalous dispersion; r. 
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